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Abstract 

We define the arithmetic self-similarity (AS) of a one-sided infinite se- 
quence a to be the set of arithmetic subsequences of a which are a vertical 
shift of a. We study the AS of several families of sequences, viz. com- 
pletely additive sequences, Toeplitz words and Keane's generalized Morse 
sequences. We give a complete characterization of the AS of completely 
additive sequences, and classify the set of single-gap Toeplitz patterns that 
yield completely additive Toeplitz words. We show that every arithmetic 
subsequence of a Toeplitz word generated by a one-gap pattern is again 
a Toeplitz word. Finally, we establish that generalized Morse sequences 
are specific sum-of-digits sequences, and show that their first difference is 
a Toeplitz word. 

1 Introduction 

Some infinite sequences are similar to a part of themselves. Zooming in on a part 
of the structure reveals the whole structure again. Of special interest are what 
we may call scale-invariant sequences. A sequence w = (w{n))n>i (over some 
additive group S) is scale-invariant if for all 'dilations' k > 1 the compressed 
sequence w/k = {w{kn))n>i is a 'vertical shift' of tr, that is, w{kn) — w{n) 
is constant. An example of a scale-invariant sequence is the period doubling 
sequence p which can be defined as p = {v2{n) mod 2)„>i with V2{n) the 2-adic 
valuation of n. We have that p/k — p-\- p{k) mod 2 for every A; > 1: 

p = 0100010101000100010001010100010101000101010... 
p/2 = 10 1110 10 10 1110 1110 111... 
p/3 = 01000101010001... 
p/5 = 1 1 1 ... 



*This research has been partially funded by the Netherlands Organisation for Scientific 
Research (NWO) under grant numbers 612.000.934 and 639.021.020. 
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Scale-invariant sequences a with first term cti = are known as completely 
additive sequences, that is, sequences cr such that a{nm) = cr(n) + cr(m) for all 
positive integers n, m. 

In general, given an infinite sequence a one may wonder what are the arith- 
metic subsequences of a similar to a itself. This leads to the notion of what 
we call 'arithmetic self-similarity': For an equivalence relation ~ on S", the 
arithmetic self-similarity of a sequence cr G E'^ , which we denote by AS^ (cr) , is 
the set of pairs (a, b) such that the subsequence of cr indexed by the arithmetic 
progression a + bn is equivalent to cr : 

^iS~(cr) — { (a, b) I aa,b ~ cr } where aa.b — o'(a) ^(a + b) a{a + 2b) . . . 

For instance, in [12] it is shown that the arithmetic self-similarity of the Thue- 
Morse sequence m = 01101001 • • • [3] with respect to 'transducer-equivalence' 
o is the full space AS^{m) = N x N>o, i.e., for every arithmetic subsequence 
ma,b of m there is a finite state transducer which reconstructs m from ma,b- 

In this paper we focus on cyclic groups (E, -|-,0) and on a particular equiv- 
alence ~ on sequences over E, namely cr ~ t if and only if cr = t + c for some 
c G E. Our main results are as follows: 

- We give a complete characterization of the arithmetic self-similarity of 
completely additive sequences a G ^^>o ; we show that (a, b) G AS{a) if 
and only if a = 6. 

- We define Toeplitz patterns to be finite words over EU(3s, where ©s is the 
set of permutations over E which play the role of 'gaps'. The composition 
operation defined on Toeplitz patterns (Definition 4.4) forms a monoid 
(Proposition 4.6). 

- We give a complete characterization of one-gap Toeplitz patterns that 
yield completely additive sequences (Theorem 5.13). These patterns are 
constructed using discrete logarithms. The completely additive sequences 
they generate are determined by infinite sets of primes. 

- We show that every arithmetic subsequence of a Toeplitz word generated 
by a one-gap pattern is again a Toeplitz word (Theorem 6.9). 

- We prove that the first difference of a generalized Morse sequence [16] is 
a Toeplitz word (Theorem 7.8). This gives rise to an embedding from 
Keane's monoid of block products (extended to additive groups) to that 
of Toeplitz pattern composition (Theorem 7.7). 

- We show how generalized Morse sequences [16] are specific sum-of-digits 
sequences (Theorem 7.5). 

- For the Thue-Morse sequence m = 01101001 ... we show that m{a-\-bn) = 
m{n) + m{a) mod 2 (n G N) if and only if < a < 6 = 2™ for some m G N 
(Theorem 7.10). 
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2 Basic Definitions 



We write N = {0, 1,2,.. .} for the set of natural numbers, and N>o for the set of 
positive integers, N>o = N \ {0}. For fc > we define = {0, . . . , fc - 1}. The 
sets of finite words and finite non-empty words over an alphabet S are denoted 
by S* and S+, respectively. We identify a (one-sided) infinite word with a map 
from N to S, or, when this is more convenient, from N>o to S. We write S^, 
or S^x), for the set of infinite words over S: 

S" = {ct I aiN^S} 5]">° = {a I c7:N>o^S} 

We let denote the set of all finite and infinite words over S. We write e 
for the empty word, and xy for concatenation of words a; € S*, y S For 
M e S* we let vP = e and = uu'^, and, for u G S+ we write for the 
infinite sequence mm . . .. For a word w G we use w{rL) to denote the value 
at index n (if defined). The reversal oi w € 'S* is defined by = e and 
{au)^ = u^a, for all a G S and u gT,*. 

We shall primarily deal with infinite words over finite cyclic groups. A cyclic 
group is a group generated by a single element (or its inverse) . Every finite cyclic 
group of order k is isomorphic to the additive group (Sfc, +, 0) with + denoting 
addition modulo k. 

Let S = (S,-|-e,Oe) be a finite cyclic group. Let w G and a G S. 

Then w -He a is defined by {w +e o,){n) = w{n) +■£ a, whenever w{n) is defined. 
Furthermore, we write a +y: t for the sequence obtained from ct, r G by 
pointwise addition: {a+sT){n) = a{n)+Y,T(n) for alln G N; so a+sa = a+Y,a^ ■ 
When the group S is clear from the context, we will write just + and for +e 
and Os. 

For G S„ (or G N) and c G (with n,m G N>o), we write fc c for the 

letter c + cH he (fc times) in the alphabet E^- For words w = ao . . . G S* 

we use li; c to denote the word (ao 0c)... (a^ c) G SJj^. Likewise for 
infinite sequences a = a^ai . . . G we write a c for (ao c)(oi c) . . . G S^. 
Finally, by the partial application 0c we denote the map from I]„ to defined 
by fc i->^ fc c for all fc G S„. For n, m G N>o we define 

, m lcm(n, m) 

C(n,m)- - 



gcd(n, m) n 

Lemma 2.1. Let c G s.t. c is divisible by ^(n, m), i.e., c = ^(n, m) c' for 
some c' G Eto- Then the map 0c is a group homomorphism from (E„, -|-e„, Os„) 

Proo/. Obviously Oe„ c = Os„. Note that n c = n Q {^{n,m) G c') = 
(n ■ i;(/7..m)) c' = Oe^^ since n ■ S^{n,m) is a multiple of m. Let k,£ G Sn 
arbitrary. If fc i <n (here +n indicates that we use addition on the natural 
numbers), then trivially (fc+s„^)0c = fc0c+E„-^0c. Otherwise fc+N-^ = n+^u 
for some u G E„ and kQc+^^iQc = nQc+^^uQc = uQc = (fc-|-s„-^)0c. □ 
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Let cr S S be an infinite sequence, and let a > and 6 > 1 be integers. The 
arithmetic subsequence Ua,b of cr is defined by^ 

<ya,b{n) ~ a-(a + bn) {n S N) 

By composition of arithmetic progressions we have {<Ja,b)c,d = Ca+bc.fcd for 
all a, c > and 6, d > 1. 

A sequence a € is ultimately periodic if there are no G N and t G N>o 
such that a{n + t) = cr(ri) for all n > hq. Here t is called the period and uq the 
preperiod. If rio = 0, cr is called (purely) periodic. 

Lemma 2.2. Every arithmetic subsequence of an ultimately periodic sequence 
is ultimately periodic. 

Proof. Let a e T,^' , e N, t e N>o with (Vn > na){a{n + 1) = cr{n)). Further 
let a G N, 6 € N>o- We show that aa,b is ultimately periodic. Let tiq G N be 
such that a+buQ > no, and let t' G N>o with bt' = (mod t) (such t' with t' < t 
always exists). Then we have aa,bin + t') ~ a{a + bn + bt') — a{a + bn) — CTa,6(n) 
for all integers n > u'q. □ 

We use P = {2, 3, 5, . . .} for the set of prime numbers. We write [a]k for the 
congruence (or residue) class of a modulo k: [a]fc = {n G N | n = a (mod k)}. 
The set of residue classes modulo k is denoted by Z/fcZ. Moreover, for the union 
of [ai]k, . . . , [aq\k G IjjkTL we write 

[ai, . . . ,aq]fe [ai]fc U • • • U [a^^ 

We let Zfc = {'L/klj)'^ = {VAk \ gcd{h,k) — 1}, the multiplicative group of 
integers modulo k. A primitive root modulo k is a generator of the group Zfc. 
That is, a primitive root is an element g of Z^ such that for all a € Zk there is 
some integer e with a = g'^ (mod k). For integers ei, 62 we have that g"^^ = g'^'^ 
(mod k) if and only if ei = 62 (mod fc), and so we have a bijection e n> (^'^ on 
Zfc, called discrete exponentation (with base g). Its inverse, a 1— >■ logg(a) = e 
with e such that a = g'^ (mod k), is called discrete logarithm (to the base g). 

Groups Ijp — { , [2]p , . . . , [p— l]p} ~ {1, 2, . . . ,p— 1} with p an odd prime 
number contain all nonzero residue classes modulo p, always have a primitive 
root and are cyclic. For example the primitive roots modulo 7 are 3 and 5. The 
powers of 3 modulo 7 are 3, 2, 6, 4, 5, 1, 3 ... , and of 5 they are 5, 4, 6, 2, 3, 1,5,.. ., 
listing every number modulo 7 (except 0). On the other hand, 2 is not a primitive 
root modulo 7: the powers of 2 modulo 7 are 2, 4, 1,2,.. ., missing several values 
from {1, . . . , 6}. Note that a primitive root is not necessarily a prime number, 
e.g., 6 is a primitive root modulo 11. 

Lemma 2.3. 

(i) (Base change) Let g, h be primitive roots mod n, a (z Z„. Then logg{a) = 
loggih) ■ log;, (a). 

^For infinite sequences cr G E'*'>o (indexed over the positive integers) we require both a 
and b to be positive integers, and then i,{n) = a{a + b{n — 1)) for all n G N>o. 
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(ii) (Fermat's little theorem) Let p he a prime number and a G Zp. Then 
aP^^ = 1 (mod p). 

(Hi) Letp > 2 be prime and g be a primitive root modulop. Then g^^"^^/"^ = — 1 
(mod p). 

Proof. Folklore. We only prove item (ill): by g being a primitive root we know 
g^ = —1 (mod p) for some unique 1 < fc < p — 1, and so g^^ = 1 (mod p) which 
in turn implies 2k = p — 1 (mod p) by item (ii). □ 

Arithmetic Self-Similarity 

Throughout the paper we fix an arbitrary finite cyclic group (I],+,0}. 

We define the 'arithmetic self-similarity' of an infinite sequence a, as the set 
of pairs (a, 6) such that the arithmetic subsequence aa,b is similar to a, where 
we interpret 'similar to' as 'is a vertical shift of. 

Definition 2.4. Let the relation ~ C S^* x S^* be defined by 

(7 ~ T if and only if a = t + c for some c G S (cr, t G S^) 

Then, we define the arithmetic self- similarity of a sequence cr G S'^ by 

AS{a) = { (a, b) I aa,b ~ o- } 

Clearly the relation ~ is an equivalence relation on E'*' as the elements of 
the group S are invertible. Equivalent sequences have the same arithmetic self- 
similarity. 

Lemma 2.5. For all cr, r G S'^, if a t then AS{a) = AS{t). □ 

Arithmetic self-similarity is closed under composition of aritmetic progres- 
sions. 

Lemma 2.6. Let a G S'* he an infinite sequence and assume {a,b) , {c, d) G 
AS {a). Then also {a + be, bd) G AS{a).'^ □ 

If for some a > 1 the shift cra^i of a sequence cr G is similar to cr, then a 
is periodic. 

Lemma 2.7. Let a G and a> 1. If (a, 1) G ^iS(cr), then a is periodic.^ □ 

^For sequences u G E'*'>'', indexed by the positive integers, the statement has to be refor- 
mulated: for all a, b,c,de N>o, if (a, b), {c,d) e AS{ct), then (a + b{c - 1), bd) € AS(cr). 
■^For sequences a G E'*'>o this reads: if a > 2 and (a, 1) £ AS{a), then a is periodic. 
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3 Completely Additive Sequences 

In this section we investigate the arithmetic self-similarity of completely additive 
sequences. 

Definition 3.1. A sequence a 6 yF^>o is completely additive (with respect toT,) 
if it is a homomorphism from the multiplicative monoid (N>o,-,l) of positive 
integers to the additive group (E, + , 0), that is 

(t(1) = f7{nm) — a{n) + a{m) (n, m £ N>o) 

The constant zero sequence z £ Y]^>" , defined by z{n) — 0^, {n & N>o), is called 
trivially additive, or just trivial. 

Lemma 3.2. Every sequence that is both completely additive and ultimately 
periodic is trivial. 

Proof. Let cr G Y:^>a be a completely additive sequence. First we assume ct 
to be purely periodic with period t G N>o. Then for all n e K(>o we have 
a{nt) = a{n) + a{t) by complete additivity and a{nt) = a{t) by i-periodicity, 
whence (j{n) — 0. 

Now let a be ultimately periodic with period t G N>o and preperiod uq G 
N>o, i.e., a{n + t) — (j{n) for all integers n > rig. It is clear that the arithmetic 
subsequence (t/uo — a{no)a{2no) ... is periodic (see Lemma 2.2). By complete 
additivity we know a{nno) = a{n) + cr(no), and so a is periodic as well. Then, 
by the observation above, we conclude that a is the constant zero sequence. □ 

Completely additive sequences are uniquely determined by their values at 
prime number positions, i.e., by a map /i : P — > S, as follows. First we recall the 
definition of the p-adic valuation of an integer n > 1, that is, the multiplicity of 
prime p in the prime factorization of n: 

Definition 3.3. Let p be a prime number. The p-adic valuation of n G N>o is 
defined by 

Vp (n) = max { a G N | p° divides n } 

The infinite sequence Vp ~ Vp(l) Vp{2) ... is completely additive (with re- 
spect to the cyclic group Z). 

Definition 3.4. Let /i : P ^ S and define the sequence t;^ G E^>'' for all 
n G N>o by 

(Here summation is the generalized form of addition of the group S.) If the set 
{p G P I fi{p) =/= 0} is finite, then v^^ is called finitely prime generated. Otherwise 
Vf^ is called an infinitely prime generated sequence. 
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Table 1: An instance of Lemma 3.7: V2.3 = V2 + v^. 



Every completely additive sequence a has a generator /i : P — > E, viz. 
fi = cr|p, the domain restriction of the function a : N>o — i- S to the set of 
primes. Hence, for every finite cyclic group S with > 2, the set of completely 
additive sequences over S has the cardinality of the continuum. 

Example 3.5. Define ^ : P ^ S4 by fi{p) = 0,1,3,2,1 ii p = 1,2,3,4,0 
(mod 5), respectively. Then = 0132101322 01320 01323 01322... is gener- 
ated by the infinite set {p \ ^i{p) 7^ 0} = P n [2, 3, 4]5 U {5}. 

If E = E2 = {0, 1} then /i : P — !> E2 is the characteristic function of a set X C 
P, and we simply write vx{n) (= X^pGX ^pC'^) mod 2). We sometimes write Vx 
where X C N to denote vpnx- For instance v^a]^ denotes the completely additive 
sequence generated by the set of prime numbers congruent to a modulo 5, which 
is infinite if gcd(a, 6) = 1 by [10]. 

Example 3.6. The sequence t = C"(0) with C : I]2~^^2 the morphism given by 
C(0) — 010 and C(l) = Oil, is the sequence of turns of the Terdragon curve [9] 
([17, A080846]). From Theorem 5.13 (and Proposition 4.13) it follows that i is a 
completely additive sequence, generated by the infinite set of primes congruent 
to 2 modulo 3, i.e., t — i'[2]3- 

Lemma 3.7. Let ^1,^2 : P S2- Then v^^+^.^ v^^ +v^2. □ 

For sets A,B CP Lemma 3.7 says vaab = va + i'b, where A is symmetric 
diflFerence: AAB^ {AUB)\{AnB) = (A \ B) U (B \ A), and so we have 
vaub — va + vb if^ni? = 0. See for example Table 1, where the 2 and 3-adic 
valuation sequences in base 2 are added modulo 2. 

If w G lf>° is additive, it is clear that {b,b) e AS{w) for all b e N>o. 
The question arises whether the arithmetic self-similarity of w possibly con- 
tains other progressions. This is not the result due to Kevin Hare and 
Michael Coons [13]. 

Theorem 3.8. Let w G T,^>" be a non-trivial completely additive sequence. 
Then: 

ASiw) = {{b,b) |5eN>o} 

Proof (K. Hare, M. Coons [13]). 

(3) Let w be a completely additive sequence over S, and let b G N>o- For all 
n G N>o we have Wb,b{n) = w{bn) = w{n) + w{b). Hence Wb,b ~ w and so 
{b,b)eASiw). 
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(C) Let u; be a completely additive sequence over S and assume (a, b) £ AS{w) 
with a^h. We will show that w is trivially additive, i.e., w{n) = for all 
n e N>o. 

Claim 1. Without loss of generality we may assume |a — 6| = ±1. 

Proof of Claim 1. By the assumption (a, b) G AS{w) we have 

Wa,b{n) = w{a + b{n — 1)) = w{bn + a — b) ^ w{n) + c (n £ N>o) 

for some c G S. So in particular it holds for n — \a — b\ ■ N , where N is 
some arbitrary positive integer. This implies that 

w{b-\a-b\- N + a-b) = w{\a - b\) + w{N) + c 

but 

w{b-\a-b\-N + a-b)^ w{\a - b\) + w{bN ± 1)) 
Subtracting u;(|a — b\) from both right-hand sides, we have 

w{bN + sign(a - 6)) = w{N) + c (a > 6) 

which concludes the proof of Claim 1. □ 

We employ the techniques used in the proof of Theorem 4 in [5] for showing 
that in both cases w is trivially additive. 

Case 1. If there is some c € E such that w{bn + 1) = w{n) + c for all 
n S N>o, then w is trivial. 

Proof of Case 1. Let c e S be such that w{bn + 1) = w{n) + c for all n. 
We will show by induction that 

w{bn + i) ^ w{n) + c (n £ N>o) (t) 

for all 1 = 1,2,.... Hence, taking n = 1, the sequence w is constant from 
position 6+1 onward. With Lemma 3.2 it then follows that w is trivially 
additive. 

For i — 1 equation (J) holds by assumption. Now assume (|) holds for 
I = 1, 2, . . . , J. Then we find: 

w{bn + j) + w{bn + 1) = 2 • w{n) + 2c 

and 

wibn + j) + w{bn + 1) = w{{bn + j){bn + 1)) 

= w{b'^n'^ + bn{j + 1) + j) 
= w{b{br? + n{j + I)) + j) 
= w{bn'^ + n{j + 1)) + c 
= w{{bn + j + l)n) + c 
= w{bn + j + 1) + w(n) + c 
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Hence w{bn + j + 1) = w{n) + c and the result follows by induction. This 
concludes the proof of Case 1. □ 



Case 2. If there is some c € S such that w{bn + 1) = w{n) + c for all 
n S N>o, then w is trivial. 

Proof. Assume, for some c G S that w{bn — 1)) = w{n) + c for all integers 
n e N>o. Then 

w{b{bn^) - 1)) = w{{bn - l){bn + 1)) 

= wibn — 1) + w{bn + 1) 
= w{n) + c + w{bn + 1) 

and 

w{b{bn^) - 1)) ^ w{bn^) + c 

— w(b) + 2 • w{n) + c 

Hence w(6n+ 1) = w{n) +w{b), for all n G N>o. By Case 1 it then follows 
that w is trivially additive. □ 

To establish the direction C, we have shown that if w is completely additive 
and (a, b) G AS{'w) for some a ^ b^ then w is trivially additive. □ 

Remark 3.9. We were pleasantly surprised to receive an e-mail from Kevin 
Hare with the beautiful proof of Theorem 3.8 [13]. At the time we had some 
partial results: we had a proof of the statement for finitely prime generated 
sequences cr, and also for some specific infinitely prime generated sequences, 
namely those produced by one-gap Toeplitz patterns (see Sections 4 and 5). 
For these partial results we refer to the first version of the current arXiv report, 
available at the following url: http://arxiv.org/pdf/1201.3786vl. Finally, 
we had also established the statement for completely additive sequences over 
the infinite cyclic group Z. 

4 Toeplitz Words 

Toeplitz words were introduced in [15], see also [1, 6]. A Toeplitz word over 
an alphabet E is an infinite sequence iteratively constructed as follows: Given 
is a starting sequence ctq over the alphabet S U {?}, where we may think of 
the symbol ? ^ S as 'undefined', and of ctq as a 'partially defined' sequence. 
The occurrences of ? in a sequence a are called the 'gaps' of a. For i > 
the sequence (Ji is obtained from o-i_i by 'filling its sequence of gaps' (i.e., 
consecutively replacing the occurrences of ? in Gi-i) by the sequence Oi-i itself, 
as made precise in Definition 4.1 below. In the limit we then obtain a totally 
defined sequence (i.e., without gaps) if and only if the first symbol of the start 
sequence (Tq is defined (i.e., is in S). 
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As in [1] we allow the application of any bijective map / : E — > E to the 
symbols that replace the gaps. Thus we let the above Oi be sequences over 
E U ©E where Ss denotes the symmetric group of bijections (or permutations) 
on E, and let the elements from ©s play the role of gaps. Filling a gap / 
with a letter a £ E then results in /(a), and filling a gap / by a gap g results 
in the function composition fog. Viewed in this way the symbol ? stands 
for ids, the identity element of ©s, and we will use it in that way. From the 
finiteness of the alphabet E and the /'s being one-to-one it directly follows that 
extending the set of Toeplitz patterns does not increase the expressive power 
of the system [1]: every Toeplitz word generated by a pattern with gaps from 
©E can already be defined using a (longer) pattern with gaps ? only. In other 
words, it is a conservative extension. 

Definition 4.1. For infinite words ct, r G (EU©s)'^>'' we define a[T] recursively 

by 

{ax)[y] ^ ax[y] {fx)[by]^f{b)x[y] {f x)[g y] ^ {f o g) x[y] 

where a,b<^Y., f,ge ©s, and x,y <E (E U ©s)'*'>''. Further let P G E(EU©s)* 
(first symbol not a gap) and, for fc = 0, 1, 2, . . ., define Tfe(P) e (E U &t.T>" by 

To(P) = r = V.l... Tk+i{P) = P^[Tu{P)] 

Then T{P) G E^>", the Toeplitz word generated by P, is defined as the limit of 
these words, as follows: 

r(F) = lim Tfe(P) 

We let |P|7 = \{h I P{h) G ©e}| denote the number of gaps in P, and, follow- 
ing [6], we call T{P) a Toeplitz word of type (r, q) when r = \P\ and q = |P|?. 

Example 4.2. Let P ~ 0?1? G (E2 U ©22)*- The sequence of 'P-generations' 
To{P),Ti{P), . . . starts as follows: 

1'^, (0?1?)", (0017011?)'^, (001001170011011?)", ... 

The limit of this sequence of sequences is the well-known regular paperfolding 
sequence: r(P) = /, see [1]; / is additive and generated by the (infinite) set of 
prime numbers congruent to 3 modulo 4. 

Proposition 4.3. The set (E U ©s)*^^" with the operation {a, r) 1-^ a[T] and 
identity element 7" forms a monoid. 

The Toeplitz word T{P) is the unique solution of x in the equation x = 
P"[a;]. The construction can thus be viewed as 'self-reading' in the sense that 
the sequence under construction itself is read to fill the gaps. For example, 
the period doubling sequence p is the Toeplitz word over E2 generated by the 
pattern 0107, as follows: 

(0107)" = 010?010?010?010?010?010?010?010?010?010?010?010? . . . 
p = T(0107) = 010001010100010001000101010001010100010101000100 . . . 
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For d e S let us denote by ?+ the rotation a ^ a +j: d {so 1 = ?+"). Then 
the pattern 010? can be simpHfied to 0?"^^, because 0?^^ =t 010?, where the 
relation =t is defined by: 

P^T Q if and only if T{P) = T(Q) 

We call the elements of the set (S U 6s)^ Yj-patterns. We continue with a def- 
inition of a composition operation o on E-patterns P, Q such that [P o Q)" = 
P"[Q"]. This then explains the equivalence 0?+^ =t 010? since (0?+^) o 
(0?"^^) = 010? and because the equivalence classes induced by =t are closed un- 
der composition. The idea of composing patterns P, Q is to first take copies P" 
and Q"* such that jP"]? — |Q™| and then fill the sequence of gaps through P" 
by Q"*. Recall that we defined ^(rt, m) = trcd(n m) ™) ~ ' C(™7 "■))• 

Definition 4.4. Let P, Q be E-patterns, and define their Toeplitz composition 
as follows: 

\P'*1[Q'^^] if |P|v>0, di=e(|P|v,|Q|),andd2=e(|Q|,|P|?). 

where u[v\ is defined for all words u,v £ (EU6s)* such that \u\7 — \v\ by 

x[e]=x {ax)[y] ^ ax[y] {f x)[b y] = f (b) x[y] {f x)[g y] ^ {f o g) x[y] 

where a,b eT,, f,g £ ©s, and G (S U 6e)*-'* 

For a E-pattern P and integer fc > we define P'-''^ by p(") = ? and 

pin+l) — p o 

Example 4.5. To compute (0??1??) o (234567) we take di = 3 and da = 2: 

(0??1??) o (234567) = (0??1??)^ [(234567)^] = 023145 067123 045167 

Proposition 4.6. The set of T,-patterns forms a monoid with pattern compo- 
sition as its operation and ? as its identity element. 

Lemma 4.7. The map P i—> P'^ is a monoid homomorphism from ((S U 
6s)*, o,?) to ((SU6s)^>°,('T,r) K^fiM,?"). 

This immediately implies (P^'^))" = T'fe(P), and hence T(P) = limfe_^oo P''"'- 
The length and number of gaps of a composed pattern P o Q are computed 
as follows: 

Lemma 4.8. |P o Q| = ^{\P\,, \Q\) ■ \P\ and \P o Q\, = e(|Q|, jPI?) • IQI? 

The congruence classes induced by —t are closed under concatenation and 
composition. 

Lemma 4.9. Let P be a H-pattern and fc > 1. Then P^ =t P and P^'^^ =t P. 

■^Note that the recursive calls of u[v] preserve the requirement \u\'r = \v\. 
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Example 4.10. The classical Hanoi sequence [17, A101607] is the sequence h 
of moves such that the prefix of length 2^ — 1 of h transfers N disks from peg 
A to peg i? if A*" is odd, and to peg C if is even. We represent moving the 
topmost disk from peg X to y by the pair {X, Y), and map moves to Eg using 

{A,B)^0 {B,C)^2 {C,A)^4: 

{B,A)>-^1 {C,B)>^5 {A,C)^3 

In [2, 1] it is shown that the sequence h is the Toeplitz word generated by the 
pattern 032?450?214? : 

h = r(032?450?214?) 

= 032045032142032045042145032045032142032145042142 . . . 

Now let / — 7^^, the involution on Eg corresponding to swapping Bs and Cs 
in moves {X,Y). Then the above pattern can be simplified to 0/2/4/, that is, 

h = T(0/2/4/) 

as composing the pattern 0/2/4/ with itself yields the pattern 032?450?214? 
from above: 

(0/2/4/) o (0/2/4/) = 032/^450/^214/^ = 032?450?214? 

The sequence of directions obtained by taking h modulo 2 (we took even 
(odd) numbers to represent (counter)clockwise moves) is the period doubling 
sequence p, see [1]: 

{h mod 2) = (r(0/2/4/) mod 2) = T(0?+^0?+^0?+^) = r(0?+^) = p 

The recurrence equations for a Toeplitz word are easy to establish. 

Lemma 4.11. Let P = ai...ar be a Yi-pattern with ai G E, r > 2. Let 

hi < h2 < ■ ■ . < hq be the sequence of indices h such that G 6s (so P has 
q < r gaps). Then for all n G N.' 

T{P){rn + i) = Oi if ^ 1^ i 1^ r and G E , and 

T{P){rn + hj) = f{T{P){qn + j)) for all 1 < j < q with ^ f . 

Example 4.12. Consider the E-pattern P = afbgh for some a,5 G E and 
f,g,h G ©E. With Lemma 4.11 we obtain the following recurrence equations 
for cr = r(P), for ah n G N: 

cr(5n+l) = a cr(5n + 2) = /(cr(3?i + l)) a{5n + 3) = b 

a{5n + 4) = g{a{3n + 2)) cr(5n + 5) = h{(T{3n + 3)) 

Toeplitz words of type (r, 1) can be obtained by iterating an r-uniform mor- 
phism [6], whence they are r-automatic [4]. 
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Proposition 4.13. 

(i) Let P e U 6s)* and define /i : E* E* by h(a) ^ P o a (a £ E). 
Then h^ia) = T{P). 

(a) Let /i : E* — ^ E* he the morphism defined by h{a) = buf{a)v (a gT,) for 
some fixed b £ T,, u,v G E*, and f e ©s- Define the Ti-pattern P by 
P = bufv. ThenT{P)^h'^{b). 

5 Additive Toeplitz Words 

Generated by Single-Gap Patterns 

We characterize additive Toeplitz words of type {r,l). More precisely, we char- 
acterize the set X of one-gap Toeplitz patterns P such that T{P) is additive 
if and only if P is in X. As it turns out, the discrete logarithm to the base g 
modulo a prime number p (see Section 2) plays a key role in the construction 
of these patterns. 

We adopt the following convention: Whenever a E-pattern P generates a 
non-surjective Toeplitz word (i.e., r(P)(N>o) ^ E), we identify gaps / occurring 
in P with all bijections that coincide with / on the letters occurring in T{P). 

Lemma 5.1. Let P = 0102 . . . such that T{P) is non-trivially additive. Then 
oi and = 1^'^ for some d G T,. 

Proof. Let a = T{P). Additive sequences have intial value 0, so a{\) = oi = 0. 
Moreover if G E^ then it follows from the definition of Toeplitz words that 
a/l is a constant sequence: (j{nli) — at for all n e N>o. On the other hand, 
we also have a / i ~ ct by complete additivity of cr, that is, a{n£) = cr(n) -I- a{£) 
for all n e N>o. This combination of facts only occurs if a is constant zero, 
contradicting the assumption that a is non-periodic. Finally, by additivity of a 
it follows that (j/£ — a + a{£) and hence the bijection at position £ has to be a 
rotation for all elements in the image cr(N>o). □ 

We let *P denote the set of one-gap E-patterns generating additive sequences: 

<P = { F e (E U 6s)* I T{P) is additive and \P\9 = 1 } 

Lemma 5.2. Let P £ with \P\ — nm for some n,m > 2. Then the arithmetic 
subsequence T{P)ijn is constant for every i with I < i < n. 

Proof. Let cr = T{P). By Lemma 5.1 we know that the (only) gap of P is at 
the end. Hence G E and from Lemma 4.11 it follows that aim,nm — io'm,m)i,n 
is constant. Moreover {am.m)i,n = (c + o■("^))^,n = <^i.n + o{m) by additivity of 
a. Hence also cr^.n is constant. □ 

Lemma 5.3. Let P e *P with \P\ not a prime power. Then T{P) is the constant 
zero sequence. 
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Proof. Let P be non-trivial and \P\ = nm for some n,m > 1 with gcd(n, m) — 1 
and a = T[P). From Lemma 5.2 it follows that all subsequences f7i.„ and ij^m 
are constant for \ < i < n and 1 < j < m. Since n and m are relatively 
prime we have that for every j < m there is a fc G N>o such that kn + 1 = j 
(mod to). Hence the subsequence cti^„ contains elements of every subsequence 
Cj.m (1 < i < "m), and thus cri,m — <y2,m ~ ■ . ■ = <Jm~i.m- Similarly we obtain 
ci^n = (^2,n — ■ ■ ■ — cTn-i.n- Consequently we have cr{k) = if fc ^ (mod to) 
or ^ (mod n). Hence cr(fc) = if fc ^ (mod lcm(n,TO)). From Lemma 5.1 
we know that the only gap is at position nm, and since lcm(n, m) = mn = \P\ 
we get P{k) = for all 1 < fc < nm. Finally the gap at nm has to map to 
because a{nm) = cr{n) + a{m) =0. □ 

Lemma 5.4. Let P — aia2 . . . G *P with r — for some prime p and k > 2, 
and T{P) ^ 0". Then P = QC^) with Q = oiaz . . . ap_i?+"'' € Vp. (Hence 
P =T Q by Lemma 4-. 9.) 

Proof. With Lemma 5.1 we have that oi, . . . , apk_i e E and a,pk G Ss (*). From 
Lemma 5.2 we know that all subsequences Ui^p with 1 < i < p are constant. 
Hence, using (*), P and Q^^^ coincide on all positions j ^ (mod p). Also, by 
T{P) being additive and the definition of pattern composition (Definition 4.4), 
we have P{mp) = am + ap = Q^'^^mp) for every 1 < to < p. Finally, P{p^) = 
ap + ttpk-i = . . . ~ k ■ Up = Q'^'^^P^). □ 

So far we have shown that all one-gap patterns which generate non-trivial 
additive Toeplitz words have the gap at the end (Lemma 5.1), of prime power 
length (Lemma 5.3), and can always be decomposed to a pattern of prime length 
(Lemma 5.4). Next we give the exact shape of these atomic patterns, which are 
defined using discrete logarithms. 

Definition 5.5. Let g be a primitive root of some prime p > 2. We define 
words Ap_g G Ep_i and A2,i G by 

Xp^g = log^(2) log^(3) . . . log^(p - 1) A2,i - 

We show that every atomic pattern in is of the form (Ap^g 0c)?^'^ for some 
c, d G S. 

Theorem 5.6. Let P G *|5 with \P\ = p for some prime p, '^ij^) non-trivial, 
and g a primitive root modulo p. Then P — {Xp^g P{g)) for some d G E. 

Proof. By Lemma 5.1 we know that the only gap is at the end and that it is 
a rotation. Let 1 < i < p. By Lemma 4.11 and additivity of T{P) we obtain 
T(P)(gV«W)^Ap,<,(z)-P(g). □ 

Example 5.7. Let E E4, and P ^ 002022?+^. This gives rise to the following 
completely additive sequence^: 

T{P) = 0020223 0020223 0020221 0020223 0020221 0020221 0020222 . . . 

^Additivity of T{P) can be checked with Lemma 5.10. 
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Theorem 5.6 gives 

P = (A7,3 P(3)) ?+^ = (021453 2) 7+^ 
P ^ (A7,5 P(5)) ?+^ = (045213 2) ?+^ 

The question remains whether every pattern of the form P = (Xp^g c)7'^'^ 
gives rise to an additive sequence T{P). The following example shows this is 
not the case. Theorem 5.12 formulates the exact requirement for c so that T(P) 
is additive. 

Example 5.8. Let E = E4, and Q = (Ay^g 3) ?+^ = (021453 3) ?+^ = 
023031?+^. Then the sequence T{Q) = 0230313 0230311 ... is not additive: 
T(Q)(8)^r(Q)(4)+r(Q)(2). 

We note that an additive sequence over E„ (with n e N>o) is surjective if 
and only if it contains the element 1^.^- 

Lemma 5.9. Let n^m Cz N>o, cr G T,'^ an additive sequence, and c € E,,,,. If c 
is divisible by ^{n,m), then cr c e EJ^j is an additive sequence. If, moreover, 
the sequence a is surjective, then the converse direction holds as well. 

Proof. For aU i,j £ N we have (tr c){ij) = cr{ij) c = (o'(i) + cr{j)) c = 
a'{i)Qc + a{j))Qc — {cr&c){i) + {aQc){j) since a is additive and by Lemma 2.1. 

For the converse direction, assume that the sequence a is surjective and 
cr c additive. There exist i,j S N such that cr(z) = 1 and cr(j) = rt — 1. Then 
cr(ij) = a{i) + a{j) =0. As a consequence a{ij) c = a{i) c + a{j) c = 0. 
Then {i +n j) c = n c = 0. Hence n -n c = (mod m), and thus c must be 
a multiple of £(n, m) = — — ^ □ 

^ ^\ ^ / gcd(n,m) 

Lemma 5.10. Let G E* such that p = \w\ + 1 is prime, and d G E. Then 
w l'^'^ £ if and only if w{k) — w{i) + w{j) for all i,j, k with < i,j,k < p 
such that k = i ■ j (mod p) . 

Proof. For the direction '=>', let P = w?^''' G Let < i,j,k < p such that 
k = i ■ j (mod p). Then w{k) = T(P){k) = T{P){ij) by Lemma 4.11 since k = 
i ■ j (mod p). Moreover, we have T{P){ij) = T{P){i) + T{P){j) = w{i) + w{j). 
Hence w{k) = w{i) + w{j). 

For the direction '<^', let P = w T^"*, and assume w{k) — w{i) + w{j) for all 
< i,j,k < p such that k = i ■ j (mod p). We show additivity of cr = T{P). 
Let n,m £ N>o, we distinguish two cases: 

(i) Case: p \ nm. There exist n' ,m' ,k' G Ep_i such that n' = n (modp), 
to' = m (mod p) and k' = nm (mod p). We have: 

a{nm) — o{k') by Lemma 4.11 and k' = nm (mod p) 

^w(k') 

= w{n') + w{m') 
= <j{n') + a(m') 

— cr(n) + cr(m) by Lemma 4.11 
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(ii) Case: p \ n or p \ m. We use induction on the exponent of the prime factor 
p in nm. For symmetry assume p \ n (the other case follows analogously). 
We have: 

Tl 

ainm) = a(—m) + d by Lemma 4.11 

P 

TL 

— cr( — ) + crim) + d by induction hypothesis 

P 

— cr(n) + cr(m) by Lemma 4.11 

This concludes the proof. □ 

Lemma 5.11. The Toeplitz word T{X,p,gl'^'^) is additive for every prime p, 
primitive root g modulo p and d £ Sp-i. 

Proof. Let p he a prime, g a primitive root modulo p, d T,, and P = Xp^g 1'^'^ . 
For P G *p it suffices to check Xp,g{k) = Xp,g{i) + Xp^g{j) for all < i,j,k < p 
such that k = i ■ j (mod p) by Lemma 5.10. This property follows from Xp^g 
being the discrete logarithm modulo p. □ 

Theorem 5.12. Let p be prime, g a primitive root modulo p, and c,d € E. 

Then we have that the pattern {Xp^g c) 7'^'^ € ^ if and only if S,{p — 1, |S|) 
divides c. 

Proof First, we observe that (*) T{Xp^g 1+'') c = T{Xp^g c?+'^®'=), and by 
Lemma 5.10 we have (Ap,g c) 7+''®'= e q3 if and only if (Ap,g c) £ *p. For 
the direction '<^=', let c e S such that £,{p- 1, |S|) divides c. Then T{Xp,g ?+'')0c 
is additive by Lemmas 5.11 and 5.9, which implies the claim by (*). For the 
direction let (Ap,g c) G ^. By (*) we obtain that T{Xp^g ?+'') c is 
additive, and by Lemma 5.10 together with surjectivity of T{Xp^g 7^'^) it follows 
that ^{p - 1, IS]) divides c G S. □ 

Theorem 5.13 summarizes the results of this section. 

Theorem 5.13. For every p G P, let g{p) denote a primitive root modulo p. 
Then we have 

'^^{PeiY.yj 6s)* I T{P) is additive and \P[? ^ 1 } 

= { ii\Mp) c) ?+')('=^ I p G P, fc > 1, c, d G S, C(P - 1, \n) I c } 
U {0+?0* } 

For additive (r, l)-type Toeplitz words we can now give the exact genera- 
tor ^ : P S. 

Corollary 5.14. Let T{P) he additive with \P\-! = I, so that P = (Xp^gQc) 

as in Theorem 5.12. Then T{P) — Vp holds, where /i : P — > E is defined for all 

q eP by 



^p.gii) c if q — i (mod p), for some i with 1 < i < p 

d ifq^p 
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The sequence T{P) is infinitely prime generated if and only if c ^ and p > 2, 
by Dirichlet's Theorem on arithmetic progressions [10]. 

Example 5.15. The pattern P — 001011?"''^ derived from Ay^a yields an addi- 
tive binary sequence whose underlying (infinite) prime set is X = P n [3, 5, 6]/ U 
{7}, that is, T{P) = vx- 

Example 5.16. Tocplitz patterns for single prime generated sequences have a 
simple shape: Let p G P, d G S and define fi(p) — d and — for q ^ p. 
Then t;^ = T{0p-^7+'^) by Corollary 5.14 (take c = Os). E.g., for the period 
doubling sequence p = V2 we have p = T{07^^). 

6 Toeplitz Permutations 

There is a strong connection between a Toeplitz pattern P and the arithmetic 
progressions through T{P). For |P|7 = 1 we show that every subsequence 
T{P)a.b is a Toeplitz word T{Pa,b) where Pa,b is derived from P. Thus the 
classification of the arithmetic self-similarity of (r, 1) Toeplitz words is reduced 
to a problem of analyzing patterns. And, using the results of Section 5, we 
conclude that AS{a) = {{b,b) \ be N>o } for all non-periodic additive Toeplitz 
words cr G lf>" of type (r, 1). 

Lemma 6.1. Fix b,r G N>o s.t. gcd(6,r) = 1 and r > 2. Then 3 c, to G N>o 
s.t. cb = r" - 1. 

Proof. Suppose that 6 f r™ — 1 for all to G N>o. Then by the pigeon hole 
principle there must be some p, s,v G N>o such that 

r^ ~ I = V (mod b) 
rP+' -l = v (mod b) 

It follows that: 

b I rP+'' - rP 
b I rP{r' - 1) 
6 I r'' - 1 

So then b divides — 1, contradicting the assumption. □ 

In the sequel we index Toeplitz patterns P (and sequences T{P)) by positive 
integers, i.e., P = P{1)P{2) . . . P{r) with r = |P|. We write P{k) to denote 
P'^{k) = P{k mod r) where we take l,2,...,r as the representatives of the 
congruence classes modulo r. 

Definition 6.2. Let P be a one-gap S-pattern of length r > 2 with its single 
gap / G ©E at the end. Fix a, 6 G N>o such that a < b and gcd(6, r) = 1. Let 
c,m € N>o with to minimal such that cb — r™ — 1 (Lemma 6.1). The arithmetic 
permutation Pa^b of P is defined by 

P^j, (a)p(™) (a + b)... P^") (a + 6(r™ - 1)) 
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Note that the pattern Pa,fc is a permutation of P^"*) (so |-Pa,6|? = 1) because 
6 is a generator of the additive group {1, 2, . . . , r™}. The followmg lemma 
generaUzes [8, Theorem 3.8]. 

Recall that we index Toeplitz patterns P (and sequences T{P)) by positive 
integers, i.e., P = P{1)P{2) . . . P{r) with r = |P|. We write P{k) to denote 
P'^{k) — P{k mod r) where we take 1,2, ...,r as the representatives of the 
congruence classes modulo r. 

Lemma 6.3. Let P G (S U ©s)^ be a pattern of length r > 2 with its single 
gap f at the end. Let a, 6 G N>o such that a < b and gcd(5, r) = 1. Then 

T{P)a,b^T{Pa,b). 

Proof. Let c,m E N>o with m minimal such that 6c = r™ — 1 so that Pa^b is 
the permutation 

Pa,b = P^"'^ {a)P^"'^ {a + b)... P^™) (a + (r™ - 1)6) 

of P(") = P(™) (l)p('") (2) . . . P^™) (r™) as in Definition 6.2. Recall that T(P) = 
T(P(™)) (Lemma 4.9), and that the single gap /™ of p(™) is at the end, 
p(m)(-^m) = /"'. Let j be the index 1 < j < r™ such that Pa,b(j) = 
From Pa^bU) = P(™Ha + (j - 1)6) we infer a + {j - 1)6 = r™ (mod r"). Note 
that j is unique with this property among l,2,...,r™. We also have that 
a + ((ac + 1) - 1))6 = a + ac6 = a + a(r" - 1) = or™ with 1 < ac + 1 < r"\ 
and so a + (j — 1)6 — ar"^ and j = ac + 1. We prove 

r(P)a,b(n) =T(P,^,)(n) 

by induction on n S N. (We tacitly make use of Lemma 4.11.) 

Let n — sr"^ + i for some s £ N and 1 < i < r™. Consider the base case 
n < r™, that is, s = 0. For i ^ j, the claim follows from the construction of 
Pa,b- For i = j we find 

r(p),,,(j) = T(pM)„,,(j) 

= T(p(™))(a + (j--l)6) 

^T{P'-"'^){ar'^') 

= f"'{T{P^"'^){a)) 

= nPa,b)U) 

Now let s > 1. For i ^ j we obtain 

T(P)a,b(rj) =r(P)(a + 6(n-l)) 

= r(P)(a + 6(n-l-r'")) 
= r(P),,b(n-r™)) 

'MT{Pa,b)in~r"') 
= T{Pa,b){n) 
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If i = j = ac + I wc find 



T{P)aAn)=T{P){a + b{n-l)) 
= T{P){a + b{sr"^ + 
= T(P){a + b{sr"' + ac)) 
= T{P){a + acb + sbr"") 
= T{P){a + a{r"' - 1) + s6r™) 
= T{P){{a + sb)r"') 
^r{T{P){a + sb)) 

= r{T{Pa,b){s + l)) 

= T{Pa,,)(sr'-+j) 

= T{Pa,b){n) □ 



Example 6.4. Let P = 0123/ with f{n) ^ n + 2 mod 5 and cr = T{P). 
Then 0-2,2 = ^"(-^2,2) with P2,2 = 1302/, and 0-2,12 = T(F2,i2) with P2,i2 = 
1302/213020130221302413021; for the latter note that c6 = 2 • 12 = 5^ - 1 = 
r™ - 1 and P^^) ^ 012320123401234012300123/^. 

Lemma 6.5. Let Q be a one-gap Toeplitz pattern of length r of the form Q = 
ufv for some f G u £ o.^'d v E Y,* . Then Q ~ Pa,b for P = u^v^ f , 
a = |w| < 6= IQI - 1. 

Proof. Immediate from the definitions. □ 

Definition 6.6. Let P = ufv with u G E+, / G 6s, and v G S*. Let j = \u\ + l 
the index of the single gap /. Let a,5 G N>o s.t. a < j and cb = r = \P\ for 
some c G N>o, and define 

B = P{a)P{a + b)...P{a + {c- 1)6) 



In the first case {{3j'){j =r a + {j' — 1)6)) we have B{j') ~ f. We also note 
that b\B\ = r and b\Pa,b\ = r^- 

Lemma 6.1. Let f,gE ©s such that f o g = g o f , and P = ufv a pattern of 
length r for some u G S+ and v G S*. Define Pf — (gou) f {gov) = g{u) f g{v). 
Thenr(Pf) =5(T(P)). 

Proof. We show T(P3){n) = g{T{P)){n) by induction on n G N>o. Let / = 
|m| + 1 (so P(j) = P^iJ) ~ /)• Let n = rn' + i for some n' G N and 1 < z < r. 
Using the recurrence relations for Toeplitz words (Lemma 4.11) we obtain: 




T(P^)(n) = P^{^) = g{P{i)) = .9(T(P)(*)) 
T(ps)(n) = /(r(P^')(n' + 1)) 1 f{g{T{P){n' + 1))) 
= 5(/(n^)("' + l)))=5(n^')H) 



if i 7^ J 



\i i = j 



□ 
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Lemma 6.7. Let Q be a one-gap pattern, and a,b € N>o s.t. b divides \Q\. 
Then T{Q)a^b — T{{Q^™''^)a,b) where m is minimal such that the (single) gap in 
Q^"^^ is at index j with j > a. 

Proof. Let P = Q(") = ufv of length r = |Q|'" for some -u e E+, / G ©s, 
and u e S*, and let j\u\ + 1 be the index of the gap in P. Note that T{P) = 
T{Q) by Lemma 4.9 and Pa, 6 = Qa,b by definition. Let n g N>o. We prove 
T{P)a,b{n) =T{Pa^b)in). 

Let c e N>o be such that bc = r and define B = P(a)P{a + b) . . . P{a +{c- 
1)6). We distinguish the following cases: 

(i) If j ^ a + (j' - 1)6 (mod r) for ah j' £ N>o, then Pa,b = B, and so 
T{Pa,b) = -S" because B is free of gaps. By Lemma 4.11 we obtain 

r(P)a,6(n) -r(P)(a + (n- 1)6) 
= P(a + (n- 1)6) 
= Pa,b{n) = P(ri) 
^T{PaM){n) 

(ii) Assume there is j' with 1 < j' < c such that j = a + (j' — 1)6 (mod r), 
so that -B(j') = /. Let r' = \Pa.b\ ~ ''^ = We make a further case 
distinction. 

(a) Assume n ^ j' (mod c). Let n ~ r'n' + i for some n' e N and 
1 < z < r', so that i ^ j' (mod c). We prove T(Pa,b)(r'j: + z') = B{i') 
for all i' = i (mod c) by induction on x G N. Using Lemma 4.11 we 
obtain (note that \B o xj? = for all x G S*) 

T(Fa,,,)(r'j; + i') = (S o u){i') = P(i') if 1 < < c|m| 
T{Pa,b){r'x + i') = (Soi;)(i") = S(i') Hi' ^ c\u\ + c + i" 

for some 1 < i" < c|i;| 

For i' = c\u\ + i" for some 1 < i" < c we find with Lemma 4.11 

T{Pa,b){r'x + i') = T{Pa,b){cx + i") 

= T{Pa b){r'x' + i'") for some x' G N, and i'" =c i 

(*) 

= S(z"') 
= 5(0 

The identity (*) holds because i" = i' = i (mod c) and r' = 
(mod c). 

For T{P)afi{n) we find, again with Lemma 4.11 
T{P)a,b{r'n' + i)^ T{P){a + {r'n' + i - 1)6) 
= T(P)(r^n' + a+(i- 1)6) 
= P(a + (i-l)6) =S(i) 
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For the third step, note that we have a + (z — 1)6 j since i j'- 
Hence we have shown T{P)a^b{n) = T{Pa^b){n). 

(b) For n = j' (mod c), let n — tc + j' for some t G N. We have that 

T{P)a,b{n)^T{P){a+{n~\)b) 

= T{P){a+{tc + j' -l)b) 
= T{P)irt + a+U' -l)b) 
= T(P){rt + j) 
= /(T(P)(< + l)) 

We also have 

Pa.bH = G{t + 1) where G ^ {f o u) f {f o v) ^ f{u) f f{v) 
because {B o x){kc + j') = {f o x){k + 1) for all a; G S*, A: G N, and 

We show T{Pa^b)ixc + j') = T{G){x + l) by induction on x. Next we 
distinguish two cases for x. If x ^ \u\ (mod r), then 

T{Pa,b){xC + j') = Pa,b{xc + j') siuCC PaA^C + j') G E 

= G(t+l) since G(x + 1) G S 

^T{G)(x + l) 

If X = |u| (mod r), then x = x'r + \u\ for some x' G N, and 

T{Pa,b)ixc + f) = T{Pa,b)iix'r + \u\)c + f) 
^T{Pa,b)ix'r' + \u\c + f) 

= f{T{Pa,b){cx'+f)) 

^g/(T(G)(x' + l)) 
= T(G)(x + l) 

We have shown T{P)a,bH = f{T{P){t + 1)), and T{Pa,b){n) = 
T{G){t + 1). It thus remains to be shown that f{T{P){t + 1)) = 
T{G){t + 1), which follows from Lemma 6.1. 

□ 

Example 6.8. Let Q = 0.gl2 with 17(71) = n + 1 mod 3. We construct Q2.2 as 
follows: First take Q^^) ^ 011205^1202120012 so that the gap index j = 6 > 
a = 2. Then we find 

Q2,2 = {Bo 01120) {ng^mn){B o 1202120012) where B = 125^22202 

Theorem 6.9. Every arithmetic subsequence of an {r,l)-type Toeplitz word is 
a Toeplitz word. 
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Proof. Let P be a pattern with r = \P\ and |P|? — 1, and let a,b E N>o- 
Without loss of generahty: 

(i) We assume the gap of P to be at the end. For otherwise, using Lemma 6.5 
we have a pattern Q of length r with its gap at the end, and 1 < c < 
r — 1 such that Qc,r-i = P- By Lemma 6.3 we have that T{Q)c.r~i = 
T{P), and hence T{P)a,b = r(Q)c+(r-i)(a-i),f)(r-i)- Then we proceed 
with constructing a pattern generating T{Q)(.+(^r-i)(a-i).b(r-i)- 

(ii) We assume b <r, since T{P) = T{P'^^'>) for all fc > 1 by Lemma 4.9. 

Let h = 6162 such that hi is maximal with gcd(6i,r) = 1. Note that (*) 
hi contains all primes in the factorization of h that do not occur in the prime 
factorization of r. Moreover let 02 G N>o such that 1 < ai < 61 where ai = 
a — (02 — 1)61. Then 

)a2,&2 

by composition of arithmetic progressions. The sequence T{P)a-^^bi is generated 
by the Toeplitz pattern Pai.fci of length r™ with one gap by Lemma 6.3 (for 
some m > 1). Thus it suffices to show that r(Pai,bi)a2.b2 is generated by a 
Toeplitz pattern. By (*) all primes in the factorization of &2 occur in r, so 
also in r™. Hence there exists n > 1 such that &2 divides (r™)" = 7-™". Since 

T{P)a,M - nP^^M), l^irlj? = 1, and jp'^^^.j - r"™ the claim follows by 
Lemma 6.7. □ 



7 Keane Words 

We generalize (binary) generalized Morse sequences as introduced by Keane 
in [16], to sequences over the cyclic additive group E, but restrict to 'uniform' 
infinite block products u x u x u x • • • , or 'Keane words' as we call them. 

Definition 7.1. The Keane product is the binary operation x on S* deffiied 
as follows: 

u X e = e u X av — {u + a){u x v) 

for all u,v eT.* and a e E. Wc define m^") by u^") = and = ux u^^\ 

Let w e E* with \u\ > 2 and u(0) = 0. The Keane word generated by u is 
defined by 

K{u) = lim 

The product u x v is formed by concatenation of \v\ copies of u (so |u x = 
|m| • \v\), taking the ith copy as u + v{i) {0 < i < |w| — 1). As is the identity with 
respect to the x -operation, u x z; is a proper extension of u whenever w(0) = 
and \v\ > 2. Hence K{u) is well-defined and is the unique infinite fixed point 
oi X 1-^ u X X. Also note that K{u) is the iterative limit of the |u|-uniform 
morphism h defined by h{a) = u + a, for all a e E; thus Keane words are 
automatic sequences [4]. 
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Proposition 7.2. (S*, x,0) is a monoid. 

Note that x is not commutative, e.g., 00 xOl = 0011 whereas 01 xOO = 0101. 

Lemma 7.3. Let u e S* with u(0) = and fc = |u| > 2, and letO <i < k. Then 
for all n > we have K{u)(nk + i) = K{u){n) + u(i), and so K{u)i,k ~ K{u). 
Hence (by Lemma 2.6) {i, fc") G AS{K{u)) for all n > and < « < fc". 

Proof. Because of associativity of x we have that u x w^"^ = u^"^ x u, which 
means that we can view as consisting of variants of u, as well as \u\ 

variants of u'^"-\ respectively: 

^ux ^{u + u(")(0)) ■■■{u + - 1)) (1) 

y(n+l) ^ ^(n) X M = + m(0)) • • • + - 1)) (2) 

Let < z < fc and < j < /c". Since we have u'"^ C /^(m) for all n > 0, by the 
use of (1) and (2) we conclude 

((j>| + i) = (u + = + 

Proposition 7.4. T/ie on/y completely additive Keane word is the constant zero 
sequence. 

Proof. Let u = uqUi . . .Uk-i £ S* with uq ^ and k > 2, and assume that 
a = K{u) G S'* is additive (note that, as tr is indexed 0, 1, . . ., additivity of a 
means (T(nm—1) = cr(n—l)+(T(m—l) for all n, m > 0). Then cr/j^/j+i = a-\-a{k), 
that is, 

CTfe [n) = a{k + {k + l)n) = a{k{n + 1) + n) = cr(n) + cr{k) [n G N) 

(3) 

Using the recurrence relations for Keane words (Lemma 7.3) we obtain 

cr(fc(i + 1) + i) = (T(i + 1) + (0 < i < fc) (4) 

Combining (3) and (4) gives a{i + 1) = cr{k) for all < i < fc. Hence ui = U2 = 

... = Uk-l. 

To see that = for all < * < fc, we show ui = ui + ui in each of the 
following three cases: For A; > 4 this is immediate: U3 = mi + ui follows from 
additivity of a. For fc = 3 we get cr(3) = + wi = Ui from equation (4) and 
(7(3) = Ui+Mi by additivity. Finally, for k — 2,we get cr(8) = ui and cr(2) = ui, 
since by Lemma 7.3 we have K{v){£"') = vi for all blocks v — vqVi . . .vi and 
n G N. Moreover, by additivity, a{8) = a{2) + a{2) =ui+ui. □ 

In what follows we let {n)k denote the base fc-expansion of n, and \w\a the 
number of occurrences of a letter a G S in a word w G S*. 
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Theorem 7.5. Let u = uqUi . . . Uk-i S S* be a k-block with uq = and k > 2. 
Then 

K{u){n) = I Wfel^ ('^ ^ 0) 

Proof. We prove (Vn e N)(|(n)fe| = r =^ K{u){n) = Y.^e^, K")^!^ © "0 by 
induction on r G N>o. If r = 1, then n G Sfe and if = u„. For r > 1, let 

(n)fc = nr-inr-2 • • ■ niriQ and define n' = . Then (n')fc = n,,_in,,_2 • • • f^l 
and = ^ ^ 1; we find 

i4:(u)(n) ^ K{u){n'k + 710) 

= K{u){n') + u„„ (Lemma 7.3) 

= I ^ \{n')kU QUi\ + Una 
ViGSfc / 

Example 7.6. For the Morse sequence m = if (01) = 01 x 01 x ■ • • , Theorem 7.5 
gives another weU-known definition of m, due to J.H. Conway [7]: m{n) is the 
parity of the number of Is in the binary expansion of n. Likewise, for the 
generahzed Morse sequence w = if (GDI) discussed in [16, 14], and called the 
'Mephisto Waltz' in [14, p. 105], we find that w{n) is the parity of the number 
of 2s in the ternary expansion of n. 

Let X = xqXi . . . Xk-i with fc > 1 be a word over E. The first difference A(a;) 
of X is defined by A(x) = e if jxj = 1 and A(a;) = (xi — xq){x2 — xi) . . . {x^-i — 
Xk-2), otherwise. 

We give an embedding of the Keane monoid (Prop. 7.2) into the monoid of 
Toeplitz pattern composition (Prop. 4.6). 

Let S = { u e 1;+ I Mo = }, and define the map At : B (S U ©s)* by 

At{u) = A{u)7^'^ for all u G B and d — uq ~ M|„|_x — ~W|u|-i 

Theorem 7.7. At : x,0) ^ ((E U 6s)+,o,?) is an infective homomor- 
phism: 

At(0) = ? At(u X w) = At(m) o At(w) 

Proof. We first show that At preserves structure. The preservation of identity 
is immediate. For c e S let At,c : E+ — > (E U ©s)"*" be defined by At.c{x) = 
A{x) for all a; e E+ (so that ATix) ^ At,c{x) iff c = -x\^\_i, for ah x £ B). 
We prove the following equation, for all u, u G E+ and d e E: 

AT.d'{u ^ v) — At{u) o AT.d(t') where d' ~ du+ d and du — — M|„|_i, 

by induction on the length of v. The claim then follows by taking d = dy = 
vo - Wit,!-! = -W|t,|-i since At(u x u) = ATA^j^dS^ ^ 
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If \v\ = 1, then V ^ a for some a G S, and 

AT,d' (u X a) = Ar^d' {u + a) 
= A(u + a)?+'*' 
= A(u)?+''' 
= (A(u)?+''")o?+'' 
= At(u) o AT,d(a) 

If |w| > 1, then V = aa'v' for some a, a' G S and w' e S*, and we find 

A.T4'{u X aa'w') = Ay.d' ((w + a)(u x a'w')) 
= A((u + a)(u X a't.'))?+'^' 

= A(m + a){uQ + a' ~ + a))A(u x a'w')?^'^' (5) 

= A{u){du + a' - a)AT,d'{u x a'u') 

= A(w)(d„ + a' - a)(AT(M) o AT,d(a'«')) (IH) 
= (A(u)?+'^")o((a'-a)AT,d(aV)) 
= AT(7i) o AT^dio-av') 

where (5) follows from A{xy) — A{x){yo — xix\-i)A{y) for all x,y E S+, and 
(■u x a'w')(0) = uo + a'. 

Finally, we show that At is injective by defining a retraction Jj, : At{B) — J> 
B. Let P be a E-pattern P G At(B). Then P has the form P — 0102 . . . a„ 
where a.; G S for 1 < i < n and a„ — 1'^'^ with c? = ~X)i<i<n'^j- Define 
/j,(P) = MqUi • • ■ Un-i by Uq = 0, = + (1 < i < n - ij. 

We show that is a left- inverse of A^- Let u — u^ui . . . u„_i e B. Then 
AT(it) = A(u)?^'' with d ~ —Un-i- Let A{u) = 0102... a„_i. Then a.^ = 
Ui — Ui-i {1 < i < n) and so /j,(At(u)) = w. □ 

It follows that the first difference sequence of a Keane word is a Toeplitz 
word. Here the difference operator A is extended to — ?► in the obvious 
way. 

Theorem 7.8. A{K{u)) = T{At{u)), for all blocks u G B. 

Proof. Let u — uqUi . . . u^^i be a fc-block over E with uq = and fc > 2. Let 
n G N>o be a positive integer and {n)k ~ nr-inr-2 • ■ .f^o the base /c-expansion 
of n. 

The first nonzero digit of {n)k (reading from right to left) is ria with a = Vk (n) 
and so 

{n)k = rir-i nr-2 ■ ■ ■ na ... 

(n - l)k = rir-i nr-2 ■ ■ ■ (na - 1) (fc - 1) (fc - 1) • ■ • (fc ~ 1) 
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Let K — K{u). From Theorem 7.5 we then obtam 



A(K)(n) — K,{n) — K{n — 1) 

= (u„^ + a • uo) - + a ■ Uk-i) 



whence 



A(«)(n) 



ui — uq if n = 1 (mod k) 

U2 — ui if n = 2 (mod k) 

Uk-i — Uk-2 if n = fc — 1 (mod fc) 

, A(K)(n//c) — if n = (mod /c) 



From Lemma 4.11 we infer that this is exactly the recurrence equation of the 
Toephtz word generated by the pattern At{u) — A(u)?^"'°"\ □ 

Example 7.9. The first difference of the Morse sequence m is the conjugate 
of the period doubling sequence p: A(m) = A(ii:(01)) = T(A(01)?+^) = 
T(l?+^) =p+l. 

The first difference of the Mephisto Wahz w [16]: A{w) = A(/-i:(001)) = 
T(A(001)?+^) = T(01?+^) turns out to be the sequence of turns (folds) of the 
alternate Terdragon curve [9] ! Moreover we also know A{w) — s/2, where 
s = r(00?+hl?+^) is the sequence of turns of the Sierpihski curve [11, 12] ! 
Note that T{On~^^) is the additive sequence i'[2]3u{3} derived from A3^2, see the 
construction in Section 5. 

We conclude with a complete characterization of the arithmetic self-similarity 
of the Thue-Morse sequence m, and leave the characterization of the entire class 
of Keane words as future work. We know that at least {i, G AS{K(u)) for 
all n > and < z < by Lemma 7.3. 

The Arithmetic Self-Similarity of the Thue— Morse Sequence 
Theorem 7.10. AS{m) = {{a,b) \ < a < b = 2"' for some m > } . 

We have the following recurrence equations for m = _ftr(01) = 01101001 . . . : 



m{2n) ~ m{n) m(2n + I) — m{n) (6) 

Carry-free addition a tt) /3 of binary numbers a = Up . . .Uq and (3 — j3q . . . is 
defined by: 



attl/3 



a + /3 if there is no i < min(p, q) with ai ^ /3i ~ 1 
undefined otherwise 



Carry-free addition preserves the total number of ones: \a W /?|i — \a\i + \f3\i. 
Hence we have that Tn{x + y) ^ m{x) + m{y) whenever (x)2 W {y)2 is defined. 



26 



Lemma 7.11. Let x > with x for any k > 0. Then m{xy) = 1 for some 
y > with m{y) — 0. 

Proof. Without loss of generality, we let x be an odd number. Otherwise, if 
x = 2x\ then m{2x'y) — m[x'y). Put differently, trailing O's of {x)2 can be 
removed, as they do not change the parity of the number of I's in {xy)2. 

Let {x)2 = Xn ■ ■ ■ xiXg be the binary representation of x (with a;„ — Xq = 1). 
Note that n > for otherwise a; = 2°. We distinguish two cases: 

(i) If xi — 0, then we take {y)2 ~ j/„ . . . yo with y„ = yo — ^ and yi — for 
< « < ri. So y = 2" + 2*^ and xy = a;2" + a; ; in binary representation: 



Xn Xn — 1 • • ' X\ Xq 

1 ... 1 

Xn Xn^\ ... X\ Xo 

Xn ■ ■ ■ Xi Xq ... 

Xn ■■■ 1 Xn-1 ■■■ Xi Xo 



To see that m{xy) = 1, i.e., that {xy)2 has an odd number of I's, observe 
that {xy)2 (the bottom line) contains the subword Xn-i ■ ■ ■X2 twice. These 
cancel each other out, and, as xi = 0, the only remaining I's and 
Xq on the outside and the created 1 at position n + 1. That makes three. 

(ii) In case xi = 1, we take y = ?>y' with y' the result of case (i) for 3a;. 
First of all note that (3a;)2 ends in 01 and so case (i) applies. Secondly, 
we get that m{'ixy') = 1, and hence m{xy) = 1. Finally, to see that 
m{y) = 0, note that y' — 2™ + 1 for some m > 0, and so m{y) = 
m(32/') = m(2™+i+2™ + 2 + l) = l-m(2™ + 2™-i + l) by the recurrence 
equations (6) for m. To see that m(2™ + 2™^^ + 1) ~ 1, one distinguishes 
cases m—1 (then m(4) — 1) and m > 1 (then we again have three I's in 
the binary expansion). □ 

Proof of Theorem 7.10. The direction O' follows from Lemma 7.3. 

For the direction 'C', assume (a, 6) e AS{m). We distinguish two cases: 
(i) 6 is a power of 2 and a > b, and (ii) b is not a power of 2, and show that 
they lead to a contradiction. 

(i) Let TO > and a > b = 2™. Consider fc > 1 such that bk < a < b{k+l) and 
let a' = a — bk. Then ma,b{n) = m{a + bn) = m{a' + b{k + n)) = ma\b{k + 
n), in short: rUa^b — {'ma',b)k.i (*)• As we have that a' < b, it follows 
from (D) above that (a', b) G AS{m), and so ma\b ~ fri {**)■ Combining 
(*) and (**) with the assumption ma f, ~ m, we obtain i ~ m. 

Suppose rUs^i — m for some s > 1 (the proof of rUs^i ^ m is analogous).^ 
This means m(s + n) = m{n) for all n. So m{s) = m(0) = 0. Now let p 
be maximal such that 2^ < s and (5)2 — SpSp^i . . . sq (so Sp — 1). Then 

^We may also conclude this case by using the fact that m is non-periodic: No positive 
iteration > 1 of the shift operator of m equals m or its conjugate, because a sequence 
a g yl'* such that (7^ i = a with A; > 1 is fc-periodic, see Lemma 2.7. 
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(s + 2^)2 10sp_i . . . So, and hence m{s + 2^) = m{s) = 0. But the 
assumption m^.i = m tells us m(s + 2^) = in{2P) = 1. 

(ii) Assume & > is not a power of 2, and let a > be arbitrary. By 
Lemma 7.11 there exists y > such that m,{y) = and m{by) — 1. 
Let m be minimal such that a < 2™. Then also m{by2"^) = 1, and, since 
(a + &2/2")2 = (0)2 W (62/2'")2, we get m(a + 6?/2'") = m(a) + 1 (mod 2). 

From the assumption {a,b) G AS{m) it follows that either (Vn)(m(a + 
bn) — m{n)), or (Vn)(m(a + fori) = m(n)). In the first case we have 
m(a) = m(0) = and m(a + 6y2™) = m{y2™) = 0. In the second case, 
we have m{a) — m(0) = 1 and m{a + &y2™) = rn{y2"^) = 1. Both cases 
contradict m(a + 62/2™) = m(a) + 1 (mod 2). □ 
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